Picture for Xiaodan Liang

Xiaodan Liang

Thinking with Geometry: Active Geometry Integration for Spatial Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Accordion-Thinking: Self-Regulated Step Summaries for Efficient and Readable LLM Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

Order from Chaos: Physical World Understanding from Glitchy Gameplay Videos

Add code
Jan 23, 2026
Viaarxiv icon

ArtiSG: Functional 3D Scene Graph Construction via Human-demonstrated Articulated Objects Manipulation

Add code
Dec 31, 2025
Viaarxiv icon

CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal

Add code
Dec 22, 2025
Viaarxiv icon

OmniGen: Unified Multimodal Sensor Generation for Autonomous Driving

Add code
Dec 16, 2025
Viaarxiv icon

DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping

Add code
Dec 10, 2025
Figure 1 for DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping
Figure 2 for DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping
Figure 3 for DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping
Figure 4 for DirectSwap: Mask-Free Cross-Identity Training and Benchmarking for Expression-Consistent Video Head Swapping
Viaarxiv icon

GLaD: Geometric Latent Distillation for Vision-Language-Action Models

Add code
Dec 10, 2025
Figure 1 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 2 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 3 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 4 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Viaarxiv icon

SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery

Add code
Dec 08, 2025
Viaarxiv icon

Video Spatial Reasoning with Object-Centric 3D Rollout

Add code
Nov 17, 2025
Figure 1 for Video Spatial Reasoning with Object-Centric 3D Rollout
Figure 2 for Video Spatial Reasoning with Object-Centric 3D Rollout
Figure 3 for Video Spatial Reasoning with Object-Centric 3D Rollout
Figure 4 for Video Spatial Reasoning with Object-Centric 3D Rollout
Viaarxiv icon